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Abstract: The relation of the Polya-Aeppli distribution of probabilities (also known as 
the Poisson-Geometric distribution) with random processes in the stationary case is 
considered in this article. Then the above mentioned distribution is derived, working 
with the probability generating functions, as limit of the distribution of rare events in a 
succession of Bernouilli trials with first order Markov dependence (that is, as a limit of 
Markov chains of rare events). 
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Introduction 


In a scheme of independent Bernoulli trials, let N , be the random number of events EF 
in nconsecutive trials. Then, if p is the probability of E ina single trial, and g is the 
probability of the complementary event E* also in a single trial (so p+q =1), it is well 
known that the probability P,(k)= P(N “S k) is given by C)'- p*-q”* where C”’ is the 
number of combinations of n things k at a time. The expected value of the random 


number of events EF is E|N, |= n-p 


In order to ease the following discussion, let us represent by B(p;n) a finite Bernouilli 
scheme of nconsecutive and independent trials, with probability p of occurring the 
event EF in a single trial. Now, let us consider a succession {B(p(n);n)} of finite 
Bernouilli schemes of independent trials ( =1,2,3,...). If the sequence { p(n) is such 
that the mean value E |" we n- p(n) of N , tends to a constant ww while the number n of 
consecutive trails increases without bounds, then it is well known that the probability 
P. (k; p(n))= P(N, =k: p(n)), (k =0,1,2,...,.n) tends to a Poisson distribution when 
n— +o 

The method of probability generating functions allows us to derive Poisson’s 


distribution as a limit distribution in a framework that can be easily generalized to the 
case of the distribution of rare events along a sequence of homogeneous Markov chains. 


1 
Let us assume that p(n)=—-(+<¢,), with ¢, > 0 when n increases without bounds, so 


n 
E becomes increasingly rare. In this case the probability generating function 


k=n 
gh (z) = vP, (k; p(n))- z‘ of the random variable N, iS: 


k=0 


£4(2)=(aln)+ plo)-2) =[14 us 


n 


#1). (. 0) (1) 


When 7 increases, g,(z) approaches the probability generating function of a Poisson 
distribution g(z) =e") A fairly general theorem of probability theory [1] allows us 
to conclude that if a sequence of probability generating functions g,,(z) converges toa 
probability generating function g(z) then the corresponding sequence P, (k) converges to 
P(k), the distribution of probabilities of the random variable characterized by g(z). 
Then, in fact, the distribution of probabilities P, (k; p(n)) that corresponds to (1) 


k 
converges to a Poisson distribution P(k)= a 7 ae 


The Polya-Aeppli distribution as the asymptotic distribution of rare events along a 
sequence of homogeneous Markov chains 


Now, let us consider a succession of Bernouilli trials with first order Markov 
dependence (a Markov chain with two elements in its state space). 


In this case the probability of occurrence of Fin the trial number n depends of the 
result of the trial numbern—1. Either directly as was done by Cernuschi and Saleme [2], 
or indirectly as was done by Bernstein [3], it is possible to calculate the probability 


k=n 
generating function eZ)= > PB (k)-z* corresponding to the random number of 
k=0 


n 


occurrences N,, of the event E in n consecutive trials along the chain. If pis the 
probability of the event Fin a trial, if in the previous trial E° occurred, and p' is the 


probability of Eif in the previous trial FE occurred, then the following formula is 
obtained ((2], [3]) 


vee A,(<))-Aa(z)) (2) 


¢, (2) = He) ale)!” 


A, (<) 


The following equations are verified: 


9-H 


(3a) A, (z)= (3b) 


a(z)=(p'-p)-z Gc) b(z)=1-p-(l-z) (Gd) c(z)=1-p+p'z (Ge) 


The mean number of occurrences of Fin 7 trials, 7, is obtained from formula (2) for 
the probability generating function, and the relation uw, = g', (1) (here g’, (1) is the 


derivative of g,, (z) taken for z = 1). 


Thus we obtain: LL, = a p-(p 


Sy EM on) a 


We can increase n without bound while maintaining mw,=%, constant, if the 


probabilities p and p' verify: 


1 
p=pln)=—-(u+e,) (5a) p'=p'(n)=pe.té,' (Sb) 


Both ¢, and ¢', tend to zero while n tends too. Besides, 420 and O< p, <1 


From (4), (5a) and (5b) we obtain: lim, ,... 4, = as (6) 
— Pe 


Substituting (5a) and (5b) in formulae (3c) to (3e), from formula (2) it follows (here, as 


usual, (2) is such thatlim, ,... n- (=) =0): 


i i 
a,(c)=(po-}-2+0{ +) (7a) 
2,(2)=1-£-(t-2)+o{ 2] (1b) 
es(e)=1+ po 2-H of +] (7c) 

mde 


Then, from (3a) (3b) (7a) and (7c): 


lim, ,,.. 4,(z:n)=lim,, 5 =1 (8a) 
2 
= a4 
lim, 4,(z:n)=lim, c,(z) vlc, o) a, (z) =Pe'Z (8b) 
From (8a) and (8b): lim, ,,.(A,(csn)—A,(z:n)) =1- po +z (9) 


From (7b), (8a) and (8b): lim, ,,(b,(z)—4,,(z;n)) =0 (10a) 


lim, ,...(b,(z)-4,{z:n))=1- po-z (10b) 


Then, from equation (2) and (8) to (10) it follows that the probability generating 


function verifies: lim ,, 5400 Zn (z ) = Ti 533 ¢ Az : n))" (11) 


Now, from (3 a) (7a) and (7c) we obtain after several rearrangements and a Taylor 


: 1 
series development in powers of — : 
n 


1 “ged 
iifen)=142{ HEAD (12) lim, ,,,.6, =0 (13) 
n 1—pe:z ‘ 


From (11) (12) and (13) we see that g, (z) is asymptotically equivalent to (A, (z;n))" 
and both approach to: 


n— we(z-1) 
g(z) = oa os i : [enw + on } me Fe ox} a ( - 4) (14) 


n \l-pe:Z l-peg-Z 
This is the probability generating function of Polya-Aeppli distribution [4]. 


Applying the same theorem used above in the case of independent Bernouilli trials, we 
conclude that P, (k), for dependent Bernouilli trials of an increasingly rare event FE 
(increasingly rare because the probability of the event EF in a trial, if in the previous 
trial E“ occurred, tends to zero) with strong first order Markov dependence (strong 
when the probability of E if in the previous trial E occurred tends to Pg > 0) converges 
to a Polya-Aeppli distribution. If p, =0, (14) reduces to the probability generating 


function of Poisson distribution. 
The Polya-Aeppli distribution as a compound Poisson distribution 
Equation (14) may be restated as a compound Poisson distribution: 
g(z)= oo Catal - (15) 
The distribution that corresponds to (15) is also known as the Poisson-Geometric. 


It is the probability generating function of the distribution of a random variable in 


which is a sum of a random number N of equally distributed random variables é poi 


N and & are suitable distributed [1]: fin = Djn1$; (16) 


In general the probability generating function of the distribution of a random variable 
wy is the composition of the probability generating function of the variable N with 


the probability generating function of the variable ¢ : 8,(z) = 6 vig z(z)} 


In the Polya-Aeppli case, the random number N has the Poisson distribution with 
probability generating function gy (z)=explw-(z—-1)] and the common distribution of 


the summed variables ¢ is a geometric distribution with probability generating function 

) = (1 ~ Pe ): z 
l-pe-Z 

uninterrupted sequences of the event E (with at least one event E in each sequence) 


along n stages of the n-th Markov chain tends to a Poisson distribution with parameter 
/t while the distribution of the random number of elements in each uninterrupted 


gelz So, when “+ the distribution of the random number of 


sequence tends to a geometric distribution with parameter Pe . 


Conclusions 


(1) Given a sequence of two-dimensional and homogeneous Markov chains with matrix 
p'(n) p(n) 
1—p(m) 1-p@) 


transition probabilities verify equations (5 a) and (5 b), the random number N n Of 


of transition probabilities for the n-th chain, such that the 


events £ in the n-th chain tends to be distributed with a Polya-Aeppli distribution with 


parameters “ and P@ when n> +c 


(2) It could be interesting to extend this kind of analysis both to homogeneous m- 
dimensional Markov chains and to non homogeneous Markov chains. 


(3) If the scheme is interpreted temporarily, as ™— + it can be assumed that the 
extension of the time interval separating two consecutive steps tends to zero. Then the 
sequences of uninterrupted events E can be interpreted, in the limit, as multiple events 


[5]. 
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